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ABSTRACT 

This study matched teacher and professor perspectives 
of preservice educational measurement courses. Twenty-eight 
professors from different colleges in seven states and 377 teachers 
from elementary and secondary schools in one iiiidwestern state 
responded via knailed questionnaire in which: professors were asked to 
assess the emphasis they give to topics in preservice educational 
measurement courses, and precollege teachers were asked to state the 
emphasis they believe should be given to the topics. Results of the 
survey show that both groups gave high ratings to items on 
preparation of exams and low ratings related to the legal aspects of 
educational measurement. In other categories, their priorities 
differed. Teacher priorities were testing activities (the practical 
concerns of testing), nontest evaluative activities, and application 
of test results. Professor priorities were statistics, standardized 
test issues, and general use of jxam results. Implications of these 
results are discussed. (Author/BW) 
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Abstract 

This study matched teacher and professor perspectives 
o-f preservice educational measurement courses- Twenty- 
eighit professors -from di-f-ferent colleges in seven 
states and 377 teachers f rom el ement ary and secondary 
schools in one midwestern state responded via mailed 
questionnaire in which: professors were asked to assess the 
einphasis they give to topics in preservice educational 
measurement courses, and precollege teachers were asked to 
state the emphasis they believe should be given to the 
topics- Results o-f the survey show that professor and 
teacher priorities match on one topic area, preparation 
of exams- In other areas, (statistics, standardised 
tests, nontest evaluation activities, and use of test 
rt^sLilts) their prioritites differed substan\^i :llv- 
I :np I i ca t i ons of those results are discussed- 
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Match! ng Teacher Trai n! ng wi th 



Teacher Needs ! n Testing 



/ Conant (1963) sparked debate over the preparation 
o-f /teachers , and in particular raised questions as to 



mt^c^sur ements. Discussion ensued, and in a 1964 
National Council on Measurement in Education (NCME) 
Syrnposit.iiTi Mayo (1964) reported measurement experts' 
expectations as to what teachers should be taught. 
Subsequent surveys (Goslin, 1967; Mayo, 1967; Roeder , 
1972) established that teacher preparation in 
educational measurements did not meet these 
expectations. Stimulated by these bindings, many 
ucii 1 eges , and universities introduced preservice 
educational measurement courses. 

fljK'.e implemented, questions began to arise 
regarding the content o-f these courses. As early as 
1 777. f ar r ar jd Gr i -f -f i n ar qued that too much emphasi s was 
benng placed on standardized tests. More recently, 
r!--f7earc.h fiiidittgs have rai^^ed questions about the 
t our in general, and specific content issues as well. 
P'l -r ( 1. fitt'H researcl") findings includes teachers report 
••allege cuur*:ies do not propf^rly prt^pare them in the i.t^ice 
'•'if M'lSt.rtu. t. t cmal te^:■>^ing (R(tl 1 i c(.^^ar^ , in press); too 
1 X t *: ! e emptiasis is placed ai) per f <jr maru: e based 




wfiat sf^ould be teac'iers' preparation in educational 
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a5BeB5meiit (Stiggins S< Br i dge?f ord ) ; teachers do not 
apply the statistical concepts taught in educational 
measurement courses (Gullickson EHwein, 1983); and 
tearhers ma»:e only minor use o^ standardized tests for 
i nstr ucti onal purposes (Gul 1 i ckson , 1983) . 

None of these studies expressly focused on what 
topics are covered in educational measurement courses 
and what emphasis should be given to the various 
topics. Thus, though the previous research suggests 
dissatisfaction with collegiate instruction efforts, 
it does not provide specific direction for improving 
the^ match between teacher needs and instructional 
i-.ontent and emphasis- This study focused on that issue 
through a survey of both precollege teachers and 
col 1 r»ge prof essors- 

The objective was to identify teacher and 
pr^jfessor priorities for these courses and, through 
r ..tri 1- r cist of th^eir respective priorities, identify 
c^iruiges which would lead to a match bs^tween perceived 
tt'.n ^le?r need<r. and mc^asur ement instruction- To do this 
bofhi gr (jups v-^er f? presented with a comprehensive list cH 
^•jpirT:^ app 1 I c '-ib 1 to a pre^^ervice course in educational 
iTir.^<;f3ur efn^-^n t arai -/a ] tia 1 1 on • Professors were? as^ ed t(» 
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were asked to rate the emphasis they believe shoul d be 
given to the topics- 
Method 

Subjects were elementary and secondary teachers 
from a midwestern state, and teachers of 
preservice educational measurement courses (hereafter 
called professors) from that state and the six 
contiguous states. Four hundred fifty teachers were 
randomly sampled from the state directory file of 
teachers in public and private schools. Previous 
research (Gullickson, 1983) with this population 
^las B^iOwri that virtually all these teachers have 
a baccalaureate degree, the large majority received 
t^leir training eithier in the state or in a contiguous 
^tate, arid over 90/C have had at least one course xn 
educ a t i onal measur ement s - 

Jo abt<^ i ri a s^amp 1 e of pr of e ssor s one-^th i r d < ) 
the colleges in the seven state region were 
-.y->it?mat: 1 c:al 1 y s:>ampled from the papulation of col 1 eqe^, 
and universities which give baccalaureate degrees in 
F-1 emF?r\ t ar y and seccuuiary educatioru Names of 
ifidi vicinal pr of esisor" were obtaifkfuJ from thp resppctivF* 
i it^U. 1 I n 1 1 ons through telephone calls to dep)artments arid 
I } {♦.} i V 1 dna 1 i> at e?vHcr\ i r^<^^ t i t u t i on . 
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Procedures 

Survey questionnaires, each with a caver letter 
v^hich described the intended research and solicited the 
teacher's response, were mailed to 450 teachers. 
Subsequently, at one and one~-hal-f week intervals, 
nonr e^^pondent teachers were mailed a reminder post 
card, a second letter with a second copy of the survey 
•form, and finally a second reminder post card. 

The designated professor at each institution was 
first called to solicit his/her participation in the 
survey, and then each was mailed a surve/ form. 
Nan -r esponder it professors were subsequently sent a 
fallow up reminder letter - 

Professors were asked to respond to a variety of 
• ^ue^t I nns regarding their educational background, 
i.:hcir-ac ter 1 st 1 cs af the preservice educational 
measurements course, and content of the preservice 
jfjec^sur ement course. Teachers were asked only their 
tt.-aLhing e); pe*r i er\ce , their grade and curricular areas 
i;f I nst r uc t: 1 on , afuJ hhe parallel items to the 
{.•rofe-i^.or ' content of preservice measurement cour s^f' 
« 1' • or^s. 

T^li-L fjaper deals directly with tln^ 6 7 c. our sn 
i}*!-^r\f 1 t f^^ms wh.ich the tv-jn quf^sfr i ruin.^i r e*^^ in 
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common. The items (listed in Table 1) provide a 
comprehensive list of topics common to most 
undergraduate courses- The list was developed through 
analysis of textbooks (e.g., Hopkins & Stanley, 1981), 
and included topics identified by Rudman et. al . (1980) 
relevant for measurement and evaluation courses- 
Professors f rom three nearby colleges reviewed the list 
fur completeness. The items were blocked into eight 
categories: general assessment information, preparing 
examinations, administering and scoring tests, 
employing other evaluative devices, computing and 
interpreting statistical data, using \.est results for 
planning (formative evaluation) purposes, using test 
results for summative evaluation purposes, and testing 
and the law — legal challenges to test practices. 

Teachers were asked to indicate the relative 
emphasis they believe should be placed on each of the 
topic areas, and professors were asked to respond 
regarding the emphasis they give to the topic in their 
instruction of the pre service measurement course. Both 
teachers and professors were asked to rate the items on 
a six point basis where O no emphasis, i.e., not 
addressed; 1 = very slight emphasis: 2 = slightly 
greater emphasis; etc., up to 5 = very great emphasis- 
Three hundred seventy-seven <84/C) ai the teachers. 
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and twenty--el/^ht (SSX) o-f the pro-fessors respondedp 
Among the professors ijiha responded, two indicated that 
their college offered no distinct measurement course, 
and consequently they did not complete that section of 
the questionnaire. Therefore a total of 26 colleges 
are represented by the professor ratings. 

Results 

Teacher and Professor responses were summarized to 
provide item meana and standard deviations for each ^ 
group (Table 1). These item means were then rank 
ordered for each group. Finally the item mean 
differences between teacher and professor ratings were 
calculated and rank ordered. The top twenty ranked 
items from each list were then displayed in Tables 2-4. 
Tables 2 and 3 show the respective teacher and 
professor priorities, while Table 4 depicts the major 
discrepancies between the two groups. 



Insert Table 1 about here 



As can be seen in Table 1, teachers tended to rate 
i terns si ightly more highly and were slightly more 
homageneous in their average ratings. Both groitps gave 
hiqt^ rating to items in the category of Preparing Exams 
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and low ratings to items related to the legal aspects 
of educational measurements. In other categories their 
pri or ities di-f-fered. 

Teacher priorities (Table 2), break naturally into 
three categories and are grouped accordingly as: 
testing activities (nine items), nontest evaluative 
activities (four items), and application of test 
results (seven items). The first category, testing 
activities, depicts practical concerns of testing. 
These items refer to content definition, selecting 
and/or writing test items, the administration of the 
test, and scoring test results. 



Insert Table 2 About Here 



Among the nontest activities class discussion 
and observing work habits stand out as particularly 
important because: a) teachers rated these items very 
highly; the average for each was above 4.0, and b) of 
al 1 the 67 i terns teacher and professor rati ngs were 
most discrepant for these two. Clearly teachers value 
thefse practices much more highly than do professors. 

Application of test results directly relates to 
decisions which a teacher could/would make based upon 
tt?sl results. Three of these items (items ranked 11, 14, 
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19) are instruction oriented; four (itemG ranked 3, 6, 
7, 17) are student oriented. In composite they 
indicate teacher interest in planning coursework r^nd 
follow-up based on test results to improve student 
learning. In particular teachers appear to prioritise 
use of tests to get at "out 1 i er character i sti cs" both 
for individual students (e.g., strengths/weaknesses) and 
for student groups within the classroom (e.g., gifted, 
slow, and underachieving learners). 

Professor priorities (Table 3) divided into four 
categories: statistics (six items), preparation of 
exams (e-i ght items), s^tandardi zed test issues (four 
items), and general use of exam results (two items), 
□f the four, professors give greatest emphasis to the 



instruction of statistics and preparation of exams. 
Those two comprise all of the top ten rated items. 



Where teachers included no 5i>tat i st i cal items in 
their top twenty, professors included five in their top 
th?r> . Essent i al 1 y every major stat i. st i cal topi c 
presented in standard measurement and evaluation 

t f>;; t hor^r s i s r epr*esentecj on the professors ' 1 i st a-r> 



Insert Table 3 about here 
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having a high priority. 

Li ke teachers, professors prioritize the 
preparation of exams. However , professors gi ve 
priority to only the front end of the testing process, 
i.e., preparation. None of the priority items refer to 
the administration or scoring of tests. 

^Two items in this category deserve special 
note. These two items, ranked 8 and 9, speak respectively 
to the comparative merit of items (taxonomy levels), and 
the comparative merits of testing strategies (objective 
vs. essay tests). Neither item was among the 
top 20 for teachers. In fact the first of the two, 
which speaks directly to skill and taxonomy levels, is 
one of the items for which teacher and professor 
ratings are most discrepant. 

Professors prioritise each of the three major 
standardized test issues: norms, reliability, and 
validity- These testing issues, when combined with 
the topic of statistics (which is central to the 
interpretation of standardised tests), result in a 
substantial emphasis an standardized testing concerns. 

The professors' application of test results category 
corrtains substantially fewer items than did the 
teachers', and is substantially different ^n content. 
Where teachers [:ir i or i 1 1 zed several specific 
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applications, professors included none. Instead 
professors included only items which refer in general 
terms to the use of test results for formative and 
summative evaluation purposes- This suggests an 
interest in the general area but either specific topics 
relevant to the professors were not included on the 
instrument or professors focus on providing a general 
overview rather than application of test results to 
speci f ic concerns - 

The major discrepancies. Table 4, complement the 
two priority lists. Here many of the individual items 
are different from those included in Tables 2 and 3 but 
the areas of major discrepancies are quite consistent 
with the differential priority patterns noted above. 



Insert Table 4 about here 



Examination of the discrepant items reveals four 
major categories: standardized tests, statistics, 
nontest evaluative activities, and use of test results. 
All four »^»prp5ent priority categories for either 
professors or teachers. Two remaining items, ranked 12 
and 16 do not clearly represent a single category. One 
regarding taxonomy levels was noted previously, as one 
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of the professor priorities within the preparing exams 
category. The other appears to suggest teachers place 
a hi gher pr i or i ty on 1 earni ng about resource 
i-nf ormation- 

The categories with greatest definition and 
direction were nontest evaluative activities and 
statistics. Three of the four nontest evaluation items 
listed as priorities by teachers appear in this list. 
Similarly of the five statistical concepts which appear 
in this list, four were listed as prioriti'ss by 
professors. (The fifth, though it has a clear 
statistical connotation, was included in the 
questionnaire in the Preparing Examinations section.) 

Both categories reemphasize the difference in 
teacher and professor priorities. Teachers value 
nontest activities but not statistics; for professors 
the opposite is true. 

Use of test results emerged as the third major 
category of discrepancy. Only one of these items was 
also listed as a priority. But these items confirm 
what was evident in the prigrity tables. Teachers 
place a much higher priority on learning to use test 
results for specific decision situations. 

Of the four discrepant items related to 
standardized tests, two are professor priorities. The 
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/ 

/ / 
remaxning twoy though not teacher priorities, depict 

teacher pref-^rences. One re-fers to the use of test 

committees, the other regards the recommendation of 

tests to administrative personnel. In general, these 

four items suggest a professor focus on the aspects of 

standardized tests which are necessary for determining 

which tests +^0 use, and a teacher focus on practical 

issues which teachers face in the selection of 

standardized tests- 

Di scussi on 

Only for preparing examinations was there a clear 
match between instructional emphasis and perceived 
needs. Even there noteworthy di f f er ences emerged. 
Professors lean toward emphasis on fundamental and 
qualitative concepts (course objectives, comparative 
merits of tests, etc.). Teachers however seem to give 
higher priority to practical concerns: the preparation 
o-f test itoms, administration of tests, and scaring of 
tests. Both professors and teachers did prioritize use 
of test results, but professor priorities were focused 
at a very general level, and not on the more specific 
i ssues desi red by teachers. 

Teachers cl ear 1 y do not share professor priori t i es 
for statistics; this is consistent with previous survey 
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findings Mayo (1967). In -fact as already noted, 
despite its h^avy emphasis in measurement courses, 
teachers do pot use statistics. Teachers also give a 
lower priority to standardized test concerns. Again 
this is consistent with their minimal use of such tests 
for instructional purposes- 

Les^ obvious but important is the discrepancy 
on issue/^ of test validity. Teachers would choose to 
give less emphasis to test validity issues than do 
profesBors. This is evident in the questions which 
speak directly to validity and in other items which 
refer to the relative characteristics of tests and test 
items. 

These striking differences between course emphases 
and teacher preferences probably result from the 
differing conceptual frameworks and contextual 
constrai nts surroundi ng teachers and professors. Most 
teachers have a very 1 i mi ted formal preparati on i n 
educational measurements and evaluation; but, they have 
a substantial faith in the validity of test^ (Farr & 
Griffin, 1972). Also, because of the demands of their 
job 4 teachers extensively use tests and other 
evaluation techniques- Calberg (1980) estimates that 
approximately 327. of a teacher's time is vested in test 
related activities. They routinely prepare administer 
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and score their own exams, evaluate their students 
through the use^ o-f nontest techniques, and apply the 
results of th^ir tests and evaluations to students and 
their teaching situations (Gullickson, 1983). Such 
condi t i ons certainl y Mould stimulate an i nterest in 
practical application concerns- 
Professors, on the other hand, have a substantial 
background in educational measurements- (The group 
surveyed typically had from eight to 15 semester hours 
in educational measurement courses at the graduate 
level.) Typically such graduate courses emphasize 
topics such as standardized tests and statistics- 
Always, such courses place a heavy emphasis on test 
validity, and informal, nontest, approaches are 
frequently cited more for their validity problems than 
for their evaluation potential* 

In addition professors routinely teach meyasur«=*ment 
courses under adverse conditions- For example 
professors typically: a) have three semester hours or 
less in which to teach educational measurements and 
evaluation, b) teach all preservice teachers in a 
single group, or teach preservice elementary teachers 
i n one group and secondary teachers i n another , and c ) 
do not teach the course in the context of any 
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individual curriculum or even in the context of 
curricular content. Rather it is usually taught as a 
unique course focusing on facets of testing germane to 
all curricular areas, or it is taught as a part of an 
educational psychology course. 

In such circumstances it is easy to see why 
professors give priority to statistics and standardized 
tests. They have a strong background in both areas. 
Plus statistical concepts are important to the 
analysis and interpretation of both standardized tests 
and teacher made tests. Further, while class 
discussion and other nontest evaluation activities 
typically suffer from validity concerns, standardized 
tests (particularly achievement tests) tend to be 
valid. In addition standardized testing concerns are' 
readily gener^.lized across curricular area^, and 
published standardized tests can be used to exemplify 
attributes which tests should and should not have. 

Whi le the respective conditions help to explain 
why the discrepancy exists, they are not a rationale 
for maintenance of such discrepancies. Previous 
research suggests that teachers, in practice, ignore 
measurement prescription that does not meet their 
needs; and clearly, present courses are not meeting 
their perceived needs. 
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The results suggest teachers see their neede 
as practical in nature; there is little in the teacher 
priorities which point to conceptual coi/icerns. Thus, 
the problem professors face is to produce measurement 
courses whi ch fit teacher percei ved ne^ds yet are 
conceptually sound. To meet those constraints it 
appears measurement concepts will need to framed within 
practical application situations. 

Presently statistics and standardized tests retain 
a central role with implications drawn from them for 
teacher made tests and evaluations. Given the present 
findings^ it seems appropriate to reverse the emphases. 
That is teacher made tests and :jther evaluative devices 
can be given central focus, and standardized tests and 
statistics can be included as supporting tools. 

Such a shift would have substantial implications 
for instruction of validity concepts. Validity 
has routinely been addressed from the context of 
standardized tests, and the majority of research on 
thi^i topic is related to standardized test use. To 
date measur e.rtent experts have simp 1 y assumed 
standardized tes>ts and teacher made evaluation 
techniques share the same validity concerns. There is 
therefore a need to gain more information about teacher 
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evaluation practices: to identify typical errors, to 
identify strategies to deal with these errors with' n 
the limits and resources of the teaching situatitv-^ and 
to determine the proper emphasis to give the t *c in 
preservice teacher preparation cources. 

Finally, it seems appropriate to change the 
context within which the educational measurements 
course is offered. Actually, the question of course 
format has been an issue from the outset. Findley 
(1964) argued, in the 1964 NCME symposium, for 
instruction of educational measurements not as a 
separate course but as a part of the methods course in 
the curricular content areas. If the "neasurement 
content were included in those courses, instructional 
groups would be more homogeneous and measurement 
concepts would be taught within the context of content 
areas to be eval uated- 
Unf or tunatei y such action would be likely to 
"create** di f f ©rent problems. Personal exper i ence 
suggests that few professors, who teach teaching 
methods for the various content areas, are skilled in 
measurement and evaluation methodology. For example, 
at the local university a check with the computer 
center revealed that none of the *' methods professors" 
ever" ask for an item analysis of their tests (this 
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service is provided free of charge to any professor vgho 
requests i t ) • 

Given these circumstances, it may be appropriate to 
teach educational measurement courses in tandem with 
the methods courses so that the general ideas gained 
from the measurement course can be given specific 
application in the methods course. Or alternatively, 
professors of educational measurements and evaluation 
could provide inservice to methods instructors who 
could then carry out preservice measurement/evaluation 
instruction^ Both actions require greater coordination 
of instruction than is typical at the college level, 
but both seem likely to yield a better match with 
precollege teacher needs. 

Uniformly these suggestion have called for changes 
at the college level to make the courses coincide with 
teacher perceived needs. This does not suggest that 
teachers properly perceive their own needs. Almost 
certainly other and different steps will have to be 
taken before educational measurement courses truly 
match teachers needs. Yet, these steps seem to be an 
appropr i ate begi nni ng . 
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Table 1 

Teacher and Professor Means and Standard Devi ati ons 



Teachers 


Professors 


Iten) Description 


M 


S 


M 


S 




3 • 36 


1. 10 


3. 10 


1.07 


1. General Assessment Inf arrif^ti on 


3.20 


1. 13 


2.95 


1.23 


A. Sources of aid in interpreting and ^ 
usi ng assessment in-f ormat ion 


3. 19 


1. 21 


2.50 


1.25 


1. interpretative manuals and devices 
which accompany published tests 


2.96 


1. 17 


2.08 


1.38 


2. orientation and instructional in- 



formation available to teachers 
through publishers and major or- 
ganizations, e.g. , Psychological 
Corporation and the National Edu- 
cational Asuoci at ion 

3- la 1.26 3.72 0.89 B. Selection and us;? of standardized and 

publ i sher prepared tests 

3.50 K24 3. 13 1,32 1. identifying tests suitable for 

particular age or grade groups 

3.67 1.14 3.17 1.40 2. determining appropriateness of 

test content for specific classes 

3.56 1.24 3.50 1.29 3. evaluating tests in terms of reli- 

ability and val 1 di ty 
(tab 1 e cont i nues) 
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Teachers 



M 



PrQ-Fessor* 
M S 



2.77 1.32 1.83 1.40 



2.93 


1 . 


15 


3. 


67 


1 . 43 


3. 44 


1 . 


24 


2. 


88 


1.73 


4. 10 


1. 


06 


4. 


00 


1 • 33 


3.55 


1 . 


07 


3. 


38 


1.40 


2.50 


1 . 




1 . 


71 


1 . 46 


3.69 


1 . 


17 




71 


1.68 


2.89 


1 . 


16 


3. 


67 


1.34 


3.58 


1 . 


1 1 


3. 


79 


1.25 


2.85 


1 . 


18 


3 m 


79 


1.47 





90 


1 


. 08 


■J" . 


96 


1 


. 37 


3. 


X.1 _> 


1 


. 17 




63 


1 


. 74 




82 


1 


. 03 


•Ij m 


74 


1 


.59 




80 


1 


.08 


~r 

■J> m 


54 


1 


.53 


3 • 


86 


1 


. 01 


3 . 


75 


1 


. 45 
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Item Description 

4. recommending tests to admini«stra- 
tive personnel 

5. test norms and interpretation 
ba*5ed upon norms 

6. cr itical revi ew of i ndi vi dual 
tests 

2. Preparing Examinations - 

A. General development concerns 
1- wqrking on test committees 

2. defining course objectives 

3. norm referenced vs. criterion 
referenced tests 

4. objective vs. essay tests 

5. defining skills and taxonomy 
levels (e.g., Bloom's Taxonomy) 

6. writing test items 

7. deter mining scoring weights 

B. Item selection and construction 

1. writing subjective test items, 
e.g., essay and short answer items 

2. writing objective items, e.g., 
multiple choice and matching items 

( tabl e cont i nues) 
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Teachers 


Professors 




Item Description 


M 




s 


M 




s 






3. 12 


1 


. 15 


3.25 


1 


.39 




3. conducting an item analysis 


3.67 


1 


. 1 1 


3. 37 


1 


21 

• X 


3. Administering and Scoring Tests ^ 


3. 48 


1 


. 12 


3. 04 


1 


. 26 


a. 


Establishing test taking routines 


3. 37 


1 


-11 


2.91 


1 


36 


b. 


Administering informal tests 


3. 13 


1 


. 25 


2. 87 


1 




c . 


Administering standardized cests 


3 ■ <£}3 


1 


. 1 1 


3. 26 


1 




d. 


Scoring tests 


3. 14 


I 


. 24 


3 5'^ 


1 

X 


. o~ 


e. 


Transf ormi ng raw scores 


'^ 41 


1 
± 




70 


1 
1 






Record i ng test resul ts 


AO 

•J m tj w 


1 


Ol 


/lo 


i'\ 
\J 




4. Employing Other Evaluative Devices 




t 


1 1 

■1 ii i. 


O . 


1 
1 




a. 


Use of check lists and rating scales 
















to evaluate procedures and products 


3 4A 


1 

X 






1 
1 




b. 


Intervi swing pupils and parents 


4 Ol 


1 

X 


0 1 




1 
X 


■SO 


c . 


Observi ng work habits 




(") 


• V uJ 




1 

1 


A4 


d. 


CI asB di scussi ons 


3.68 


1 


. 17 


2. 17 


1 


. 64 


e. 


Interpersonal rel at i on ships 


2.67 


1 


. 16 


2. 22 


1 


.59 


f . 


Admi ni ster i ng soci ometr i c tests 


2 , 80 


1 


. 26 


2. 18 


1, 


. 37 


g- 


Wr i ting anecdotal reports 


2.81 


1 


. 12 


3.90 


0, 


.83 


5. Computing and Interpreting Statistical 














Data 


2.58 


1 , 


. 06 


4. 00 


1 , 


. 13 


a. 


Measures of central tendency and 



var i abi 1 i ty 

(table continues) 
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Teachers 


Professors 




I tern Descr i pt i on 


M 


S 


M 


S 






3.10 


1.13 


3 S3 


1 '7"^ 
± ■ .^o 


b. 


Percentages and percentiles 


2.92 


1. 12 


4.00 


1.00 


c. 


Standard scores and the normal dis- 
tribution 


2.87 


1.22 


3. 17 


1. 19 


d. 


Readi ng graphs 


2.57 


1. 18 


3.83 


1. 19 


e. 


Correlations and reliability coef— 
fi cents 


2.66 


1 .21 


3.04 


1.36 


f . 


Reading statistical reports of test 
resul ts 


3.76 


1.03 


3.65 


0.67 


6. Using Test Results -far Planning (Forma— 



4.05 0.99 3.39 0.94 



3.63 1.19 



3.11 1.24 



;.04 1.30 



3.68 1.06 3.04 1.40 
3- 48 1.00 2.78 1.24 

3.40 1.27 3.13 1.25 



ti ve Evaluation) Purposes 

a. Interpreting test profiles to iden- 
tify pupil strengths and weaknesses 

b. Pretesting to determine required 
instructi onal emphases 

c. Organizing homogeneous groups on the 
basi s of test performance 

d. Using test data to guide remediation 

e. Examining test data to anticipate 
level of class performance 

■f • Using test scores to evaluate 
teaching 

< table cont i nues ) 
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Professors 
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Item Description 



M 




S 


M 




S 




/ 
/ 


3. 


47 


1 


. 03 


3. 55 


1 


.00 


7. Using Test ResuItjS for Summati vf-> 
















Evaluation Purposfees 


'J> m 


64 


1 


.00 


2.87 


1 


.29 


el ■ 


rvecofninenu 1 ng coLinseJiiny or rsnie 


















□ 1 ax. 1 on 




.60 


1 


. 16 


3.26 


1 


.45 


b. 


Assigning -final grades 


3 • 


89 


1 


.06 


3. 17 


1 


.37 


c. 


Identifying gifted pupils or slow 


















learners 


3- 


86 


1 


.ob 


3.39 


1 


.20 


d. 


Identi f ying under-achievers 




51 


1 


. 18 


2.61 


1 


.23 




Recommendi ng promoti on , accel erati on 


















or retar dat i on 




73 


1 


.37 


1.48 


1 


.34 


f . 


Recommend i ng emp 1 oyment 


3. 


14 


1 


.31 


1.96 


1 


.55 


g- 


Recommending higher education 




74 


1 


.36 


2.29 


1 


.31 


8. Testing and the Law. Legal Challenges 
















to 


Test Pract i ces 




48 


1 


.30 


1.91 


1 


.34 


a. 


Court cases and major considerations 


















in those court cases 


2. 


52 


1 




2.00 


1 


.45 


b. 


Major rulings of courts 




83 


1 


.32 


2.29 


1 


.31 


c . 


Legal r estr i ct i ons 




76 


1 


. 37 


2.36 


1 


.29 




1. on use of standardized tests 




90 


1 


.37 


2.41 


1 


.40 




2. on use of teacher prepa*^ed tests 



Note , The rating scale ranged from 0 = no emphasis to 
5 = very great emphasis. 
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Table 2 

Teacher and Pro-fessor (P) Item Means -for the 20 Items 
Teachers Rated Most Highly 

Teacher P Item Descriptions 

Rank Mean Mean 

Preparing Examinations 



1 


4. 


10 




39 


Pr ep ar i n g Ex ami nat i on s 




5 


o . 


90 


3. 


96 


Wri ti ng test items. 




a 


3 . 


86 


3 . 


75 


writing objective items, e.g. , multiple 












choi ce and match i ng items. 




9 


3. 


82 


3. 


74 


Item selection and construction 




10 


•3 • 


80 


3 • 


5^ 


Writing Sub jecti ve test items, e.g. 


, essay 












and short answer items. 




12 


3. 


69 


3. 


71 


Def i n i ng course ob ject i ves 




15 


3. 


67 


3. 


37 


Admi n i ster i ng and scor i ng tests. 




16 


O . 


67 


3 m 


17 


Determining appropriateness of test 


conten 












for speci f ic cl asses. 




18 


s.* m 


63 


3. 


26 


Scor i ng tests- 














Nontest Evaluation Activities 






4. 


06 




39 


Cl ass di scussi on 




4 


4. 


01 


4kl m 


43 


Observing working habits^ 





(table continues) 
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Teacher 


Prof . 


Item Descr i pti ons 


Rank 


Mean 


Mean 




13 


3. 68 


2. 17 


Interpersonal rel ati onships. 


20 


3. 60 


3.40 


Employing other evaluati ve devices. 








Application of Test Results 


T 


4. 05 


3.39 


Interpreting test profiles to identify pupil 








strengths and weaknesses. 




3. 89 


3. 17 


Identifying gifted pupils or slow learners. 


7 


3.86 


3. 39 


Identi f ying under — achi evers- 


11 


3.76 


T / cr 

O m 


Using test results for planning (formative 








eval uat i on ) purposes. 


14 


3.68 


3.04 


Using test data to guide remediation. 


17 


3.64 


2.87 


Recommending counseling or remediation. 


19 


3 . 63 


3.04 


Pretesting to determine required. 



i nstructional emphases. 
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Table 3 

Teacher <T) and Pro-fessor (P) Item Means -for the 20 Items 
Professors Rated Most Highly 



p 




T 




p 


item Descri pt ion 






Mean 


Mean 
















Statistics 




1 


2. 


92 


4. 


00 


Standard scores and tha normal distribution 




^> ■ 


58 


4. 


00 


Measures o-f central tendency 


and 












var iabi 1 i ty- 




5 




Bl 




90 


Computing and interpreting statistical data. 


6 




57 




83 


Correlations and reliability 


coef f ici ents- 


7 


— r 


10 


3. 


83 


Percentages and percentiles- 




19 




14 


o . 


52 


Transforming raw scores. 














Preparation of Ex ami nations 






4. 


10 


4. 


00 


Prepar i ng examinations 




4 


3 . 


90 




96 


Writing test items. 




Q 


^ ■ 


85 


3 • 


79 


Defining skills and taxonomy 


level s (e.g., 












Bloom's taxonomy) . 




9 


3 . 


58 




75 


Objective vs. essay tests. 




10 


*^ 


86 




74 


Wr i t i ng ob ject i ve i terns , e.g. 


, multiple 



choice and matching items. 

(table cont 1 nues ) 
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p 


T 


P 


Item Description 


Rank 


Mean 


Mean 




11 


3.82 


3.74 


Item selection and construction. 


13 


3.69 


3.71 


Defining course objectives. 


IS 


3.80 


3.54 


Writing subjective test items, e.g. , essay 








and short ansMer items. 








Standardized Test Issues 


12 


3. 18 


3.72 


Selection and use o-f standardized and 








publisher prepared tests. 


14 


2.B9 


3.67 


Nprni-ref erenced vs. criterion-referenced 








tests. 


15 


2.93 


3.67 


Test norms and interpretation based upon 








norms. 


20 


3.56 


3.50 


Evaluating tests in terms of reliability and 








val i di ty . 








General Use of Exam Results 


16 


3. 76 


3 . 65 


Using test results for planning (formative 








evaluation) purposes. 


17 


3.47 


T crc- 


Using test results for summative evaluative 



purposes 
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Table 4 



Major Mean Discrepancies (MP) Between Teacher and 



Professor Mean Ratings 



Rank 



MD 



Item Description 



1.67 
1.58 
1.51 
1,24 



Nontest Eval uat i on Act i vi t i es 



Class Di scussions. 



□b ser V i ng war k h ab i t s . 
Interpersonal Relationships. 
Interviewing pupils and parents. 



10 



14 



-1.42 



-1.26 



-1.09 



-1.08 



O- 90 



Stati sti CB 
Measures o-f central tendency and 

var iabi 1 i ty . 
Carrel at i ons and reliability 

coefficients. 
Computing and i nterpreting stati stical 

data. 

Standard scares and the normal 

di stri but i on . 
Determining scoring weights. 

(tab 1 e cant i nues ) 
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Standardized Tests 
Working on test committees. 
Norm referenced vs. Criterion referenced 
tests. 

Use of Test Results 
Test norms and interpretation based upon 
norms. 

Recommendi ng emp 1 oyment 
Recommend i ng hi gher educat i on . 
Recommending tests to administrative 
personnel . 

Recommending promotion, acceleration, or 

retardation- 
Organizing homogeneous groups on the 

basis of test performance. 
Recommend i ng counsel i ng or r emed i at i on . 



(tabl e conti nues) 
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Rank MD Item Description 

Other 

12 -0.94 Defining skills and taxonomy levels 

(e.g.. Bloom's taxonomy). 
16 0.8S Orientation and instructional 

information- 
Note. If MD is positive, teachers had the larger mean rating. 
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